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Abstract 

We prove an interesting fact about Lottery: the winning 6 numbers (out of 49) in the game of the Lottery 
contain two consecutive numbers with a surprisingly high probability (almost 50%). 

1 Introduction 

The game of lottery exists and has been run in many countries (such as the UK, the US, Germany, France, 
Ireland, Australia, Greece, Spain, etc.) for a number of years. In this game, the player chooses m numbers from 
among the numbers 1, . . . , n > m, the order of the choice being unimportant and the values of n and m varying 
from country to country; the lottery organizers choose publicly m numbers in the same way, and if they are the 
same with the ones the player chose, the player wins. Newspapers usually publish the winning set of numbers 
along with statistics on the number of times each particular number from 1 to n has appeared in the winning 
set. It is however a slightly different and more elusive statistical observation that will be of interest to us here. 

Some people have noticed that, in the usual case m — 6 and n — 49, it happens very often that at least 
two of the winning numbers are "close" to each other. As 6 out of 49 is not really many, this seems at first to 
be paradoxical, if not altogether wrong, and may remind us strongly of another very similar famous paradox, 
the Birthday Paradox. In this work we will prove that this observation is well founded, even if we adopt the 
strictest interpretation of numbers being "close", i.e. that they be consecutive. Our problem to solve then will 
be the following: 

"What is the probability that, out o/m > numbers drawn uniformly randomly from the range 1, . . . ,n > m, at 
least two are consecutive?" 

We will calculate this probability in two ways below: one quite "mechanical", by finding a recursion and 
then solving it by means of generating functions, and one combinatorial, which will actually yield a more general 
result. We will also see that this problem, at least for the usual values m — 6 and n = 49, leads to a novel and 
unexpected gambling application. 

2 First solution 

Let f{n,m) be the number of ways in which m numbers can be chosen out of 1, . . . ,n so that no two are 
consecutive. For any particular choice, one of the following will hold: 

• Neither 1 nor n is chosen: we have to choose m numbers among 2, . . . , n — 1 and the number of ways this 
can be accomplished in is /(n — 2,m). 

• 1 and/or n is chosen: the number of ways this can be accomplished in is, according to the inclusion- 
exclusion principle, the sum of the number of ways of choosing 1 and choosing n minus the number of ways 
in choosing both. Observe now that 2 cannot be chosen if 1 is, and that n — 1 cannot be chosen if n is. 
Then, in the first two cases the number of choices is f{n — 2, m — 1), and in the last one /(n — 4, m — 2), 
so that the total number of choices if 1 and/or n is chosen is 2/(n — 2, m — 1) + f{n — 4, m — 2). 

Accordingly, summing both cases: 

fin, m) = fin - 2, m) + 2 fin - 2, m - 1) - /(n - 4, m - 2) 

In addition to the recursive formula above, we need some boundary conditions as well, corresponding to 
n = 0, 1, 2, 3 and m = 0, 1. They are provided by the following: 

• We can choose no numbers in only one way: /(n, 0) — 1, Vn > 0. 
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• We can choose one number in n ways: f{n, 1) = n, Vn > 0. 

• /(3,2) = 1 

Let us now write down the generating function for f{n,m): 

oo [2"] 



The upper boundary for m is determined by the fact that /(n, m) = if m > +1. 

00 ["2 1 

By multiplying the recursion formula by z^w"^, and applying the operator ^ ^ , we get: 



where 



F{n,m) = Fi(n,m) + 2F2 (n,m) — Fz{n,m) 



00 r 2 1 

Fi(n,m) = ^^/(n-2,m)zV 
F2(n,m) = ^ ^ /(n-2,m-l)aV 

00 [t] 

F3(n,m) = ^ ^ /(n-4,m-2)aV 



For each of the three functions, we get 

n=2 m=2 n=2 m=2 



n=4 m=2 



^^/(n,m)^"«;'" + /(3,2)^3«; 

= 0^ [F{z,w) + z^w^] 



rti- 



00 r 2 1 



n=2 m=2 



00 r 2 1 00 

Yl "^)^""'"' + 2)/«;^ + ^ f{n, l)z^ 
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F{z, w) + z^w^ + w ^ nz" 



00 [2]"*"^ 00 [2! 00 [2! 



n=0 m— 2 
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F(2, w) + z^w'^ + w X) na" + X) 
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We still need three auxiliary computations: 



Putting all of the above together, and after some further algebraic simplifications, we find: 



F[z, w) — w z 



2.48 + 2(2- 3 + 1)^) 



(2 — 1)^(1 — 2 — wz^) 



Of course, this is not the full generating function, as the cases n = 1, 2, 3 and m = 0, 1 are entirely missing; we 
omitted them in order to avoid to have to deal with "weird" boundary conditions such as /(— 3, —1) etc. But 
now we can add them back. 

Remember that /(n, 0) = 1, n > and f{n, 1) = n, n > 1; but we have already carried out the relevant 
computations as auxiliary computations above. Therefore: 

32 1 2™ 
T(z,w) = F(z,w) + z w +- 1-7- 

i — 2 (t ~ 2J^ 

where the first fraction is the generating function for f{n,0) and the second for /(n, 1). After some algebraic 
simplifications, we find: 

1 + zw 1 + zw 1 2(1 + 2w) l^-^r n , M" 

T(z, w) = — — ^— —— = — > [2(1 + wz)] — 

1 — Z — wz-' 1 — Z — wz 2 1 — 2(1+ wz) 2 

n — 1 

oc n / \ oooo/ 

Ey^ / n \ ^n+rn-l^m _ ST^ ST^ I n — m+1 
^ \ m ) ^ ^ \ ra 

n=\ Tn— m— n— m 

SO that 

J,, N / n - m + 1 
f(n,m) = 



m 

If then we draw m numbers from the range 1, . . . , n, the probability no two are consecutive is: 



q{n,m) = 

so that the solution to our original problem is: 

p(n, m) — 1 



n — m + 1 
m 

n 
m 



n — m + 1 
m 

n 
m 



We should note here that a proof of the formula for /(n, m) based on induction appears in 



3 Second solution 

The second solution, combinatorial in nature, allows us to solve a more general problem: in how many ways 
/fc(n, m) can we choose m numbers among the numbers 1, . . . , n so that the minimum distance between any two 
of our choices (which we will be calling the distance of our choice) is fc > 0? There is a very simple formula for 
that. 

Imagine we have numbered n balls with the numbers 1, . . . ,n, and that we have chosen the numbers 1 < 
A''i < ... < Nm < n. For every number chosen but the last one, remove the numbers of the m — 1 balls 
immediately following it; as for the remaining balls, renumber them consecutively and in the order they are. We 
will end up with n~{k — l)(m — 1) balls numbered consecutively from 1 to n — (fc— l)(m— 1), and (k — l)(m — 1) 
blank ones. This final situation will not depend on the balls we chose originally, although the exact positioning 
of the blank balls among the numbered ones will. Notice finally that the original number of every ball can be 
recovered: it is the number of balls preceding it, including itself! 



Any valid choice of m numbers in the original numbering will correspond to a choice of m numbers after 
renumbering, and vice versa: after we choose m numbers between 1 and n — {k — l)(m — 1), we insert blanks as 
described above and renumber, getting a valid choice of numbers in the original numbering. This correspondence 
is obviously bijective. Therefore, 

,. / X , n — (fc — l)(m — 1) . , , ^ , 

/fc(n,m)= ^ '],n>m>l,k>l 



For fc = 2 we recover the result of our first solution, and hence the same probability p{n, m) of at least two 
choices being consecutive. We also obtain the more general formula 



Pk(n,m) = 1 



n - (fc - l)(m - 1) 
rn 



n 
m 

for the probability that at least two of the winning numbers have a distance less than k. 



4 Application in gambling 

The probability p{n, m) can actually be quite large, maybe unexpectedly large: for example, for the usual values 
n = 49 and m = 6, we find p(49, 6) ~ 0.495198. Therefore, the observation that the winning six numbers of the 
lottery often contain two that are very close is well founded; in almost one game out of two the winning set of 
numbers contains two consecutive ones! 

Moreover, as p(49, 6) is very close to 0.5, the problem we just studied can be turned into a successful casino 
game: the player bets €e that 6 numbers randomly chosen among 1, . . . , 49 will contain at least two consecutive 
ones. If this happens, the player gets €e from the house, otherwise the house wins the player's money. This 
game is almost fair, as the player has an almost 50% chance to win; but he actually has slightly less than that, 
and this gives the house a (profitable) advantage! 



5 A slight variant 

What would happen, though, if the player suggests that numbers 1 and n be treated as consecutive as well, 
namely if we order the numbers on a ring instead of a line? There should now be fewer possible choices for 
non-consecutive numbers. Indeed, let now gk{n,m) be the number of possible choices of m < n among n > 
numbers so that the minimum distance between any two of the chosen ones is k; in other words, among any two 
chosen numbers, with the property that no number between them is chosen, there are at least k — 1 numbers 
lying between them. Then, we can split the choices into those in which one number among 1, . . . , fc — 1 is chosen, 
and those in which this is not the case: 

• If one ball among 1, . . . , fc — 1 is chosen, then the remaining m — 1 balls can be chosen among n — 2fc + 1 
balls (we exclude the chosen ball and the k — 1 adjacent balls on either side); but now, by removing a 
block of 2fc — 1 balls from the circle, we turn it into a line, so the total number of choices, for a fixed choice 
within 1, . . . , fc, is fk{n — 2k + l,m — 1); and since every different choice within 1, . . . , fc leads to different 
possible choices, the total number of choices in this category is (k — l)fk{n — 2k + l,m ~ 1). 

• If no ball is chosen among 1, . . . , fc — 1, then we can just remove them, turn the circle into a line, and 
renumber: we need to choose m balls among the remaining n — fc + 1, obeying the distance restrictions, 
and this can happen in fk{n — k + l,m) ways. Therefore, 



gk{n, m) = (fc — l)/fe(n — 2fc + 1, m — 1) + /^(n — fc + 1, m), n>m>l,fc>0 
If we define now 



pfc(n,m) 



^ ^ _ gk{n,m) ^ ^ _ 



n-fc + l-(fc- l)(m -1)\ /n-2fc + l-(fc- l)(m - 2) 
m / V yn — 1 



n \ I n 

ml \ m 



we find that p2(49,6) = p(49, 6) ~ 0.503203. Therefore, if some casino agreed to play this variant of the game 
with a player, the player would have a slight advantage over the house, and the latter would loose money! 
Table gives the values of pfe(49, 6) and pfc(49, 6) for fe G N*: 
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k 


Pfc(49,6) 


Pfe(49,6) 


1 








2 


0.495198 


0.503203 


3 


0.766686 


0.806793 


4 


0.903824 


0.937157 


5 


0.966031 


0.984296 


6 


0.990375 


0.997447 


7 


0.99806 


0.999821 


8 


0.999785 


0.999999 


9 


0.999994 


1 


> 10 


1 


1 



Table 1: The probabilities that the winning set of numbers of the standard Lottery has a minimum distance k. 
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